Towards High Performance Disaggregated Flash Storage with Programmable NIC

نویسنده

  • Yuanwei Lu
چکیده

Existing storage disaggregation solutions all require remote host memory as a proxy to access storage devices. This creates a single bottleneck between CPU root complex and PCIe switch and induces too much DRAM footprint. Moreover, those software solutions introduce high tail latency due to commands batching, system interrupt and context switch. This latency overhead is non-trivial for fast NVMe flash storages. We propose S-Direct to direct access remote NVMe flash storage using a programmable NIC. This new design bypasses remote CPU and doesn’t require host memory as a proxy, thus removing the single bottleneck. S-Direct enables peer-to-peer communication among flash devices which brings a new dimension of flexibility to create new storage access commands. Our preliminary results show that S-Direct achieves up to 8.2x tail latency reduction without throughput drop compared with previous approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hardware NIC Scheduler to Guarantee QoS on High Performance Servers

In this paper we present the architecture and implementation of a hardware NIC scheduler to guarantee QoS on servers for high speed LAN/SAN. Our proposal employs a programmable logic device based on an FPGA in order to store and update connection states, and to decide what data stream is to be sent next. The network architecture is connection-oriented and reliable, based on credit flow control....

متن کامل

Threading Opportunities in High-Performance Flash-Memory Storage

Storage Intensive Supercomputing The storage-intensive supercomputing (SISC) project [1] at LLNL is a research effort that is currently investigating hardware architectures for improving the performance of large, data-intensive applications. In order to achieve nextgeneration performance goals for these applications, it is necessary to consider system architectures that employ both computationa...

متن کامل

Faster DB2 Performance with IBM FlashSystem

I/O performance is critical to DB2 operations. Although you can improve performance by using faster processors, faster systems, and advanced DB2 capabilities, data must still be read from and written to storage. Traditional hard disk drive (HDD) storage cannot provide the speed that is required by many applications. Many solid-state device (SSD) solutions add latency because software is require...

متن کامل

Efficient Flash Memory Read Request Handling Based on Split Transactions

Flash memory is a storage class memory widely used in mobile computing systems due to its small size, low power consumption, fast access time, and high shock and vibration resistance. Flash memory based storage systems exploit chip-level parallelism and hide the latency of flash memory operations through request scheduling. However, conventional scheduling techniques are inadequate for handling...

متن کامل

Integrating Solid State Storage in a PCI Express Clustering Interconnect

Executive Summary Direct PCI Express (PCIe) attached flash storage offers unparalleled performance levels and vastly outperforms SAS/SATA-attached flash storage solutions. However, such PCIe attached flash storage cards, in their existing form, are restricted to the confines of the PCIe slots in the host server. This approach can neither support a disaggregated storage (JBOD/RAID-like) mode tha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017